Accelerating recommendation system training by leveraging popular choices

نویسندگان

چکیده

Recommender models are commonly used to suggest relevant items a user for e-commerce and online advertisement-based applications. These use massive embedding tables store numerical representation of items' users' categorical variables (memory intensive) employ neural networks (compute generate final recommendations. Training these large-scale recommendation is evolving require increasing data compute resources. The highly parallel portion can benefit from GPU acceleration however, large often cannot fit in the limited-capacity device memory. Hence, this paper deep dives into semantics training obtains insights about feature access, transfer, usage patterns models. We observe that, due popularity certain inputs, accesses embeddings skewed with few entries being accessed up 10000X more. This leverages asymmetrical access pattern offer framework, called FAE, proposes hot-embedding aware layout recommender utilizes scarce memory storing embeddings, thus reduces transfers CPU GPU. At same time, FAE engages accelerate executions hot entries. Experiments on production-scale real datasets show that overall time by 2.3X 1.52X comparison XDL CPU-only CPU-GPU execution while maintaining baseline accuracy.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QoS-based Web Service Recommendation using Popular-dependent Collaborative Filtering

Since, most of the organizations present their services electronically, the number of functionally-equivalent web services is increasing as well as the number of users that employ those web services. Consequently, plenty of information is generated by the users and the web services that lead to the users be in trouble in finding their appropriate web services. Therefore, it is required to provi...

متن کامل

Accelerating the Composite Power System Planning by Benders Decomposition

This paper presents an application of Benders decomposition to deal with the complexities in the simultaneous Generation Expansion Planning (GEP) and Transmission Expansion Planning (TEP). Unlike the power system operation fields of study, the power system planning methods are not expected to be fast. However, it is always preferable to speed up computations to provide more analysis options for...

متن کامل

Recommendation System Leveraging Heterogeneity of Trust in Social Networks

Recommendation systems are an integral part of many online services ranging from content streaming websites like Netflix to online shopping websites like Amazon. They improve the online experience by suggesting new products that match users’ interests and preferences. Collaborative filtering [4] methods are the most commonly used techniques in recommendation systems which rely on collection and...

متن کامل

Discovering Popular Clicks\' Pattern of Teen Users for Query Recommendation

Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...

متن کامل

Accelerating partial order planners by improving plan and goal choices

We describe some simple domain-independent improvements to plan-reenement strategies for well-founded partial order planning that promise to bring this style of planning closer to practicality. One suggestion concerns the strategy for selecting plans for reenement among the current (incomplete) candidate plans. We propose an A* heuristic that counts only steps and open conditions, while ignorin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the VLDB Endowment

سال: 2021

ISSN: ['2150-8097']

DOI: https://doi.org/10.14778/3485450.3485462